Shape invariant time-scale modification of speech using a harmonic model
نویسندگان
چکیده
A new and simple approach to shape invariant timescale modi cation of speech is presented. The method, based upon a harmonic coding of each speech frame, operates entirely within the original sinusoidal model [3] and makes no use of \pitch-pulse onset times" used by conventional algorithms. Instead, phase coherence, and thus shape invariance, are ensured by exploiting the harmonic relation existing between the sine waves to cause them to be in phase at each adjusted frame boundary. Results suggest this approach to be an excellent candidate for use within a concatenative textto-speech synthesiser [2] where scaling factors typically lie within a range well handled by this algorithm.
منابع مشابه
Shape invariant pitch modification of speech using a harmonic model
We present a simple but e ective approach to pitch modi cation of speech based on a harmonic model. Building on our time-scaling algorithm [1], pitch modi cation applies to a harmonically coded glottal wave estimate derived via a simple inverse ltering technique [3]. The modi ed glottal wave subsequently serves as input to an LPC vocal tract lter and the pitch-scaled speech is generated. Shape ...
متن کاملEnhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis
To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary...
متن کاملConformal spherical representation of 3D genus-zero meshes
This paper describes an approach of representing 3D shape by using a set of invariant Spherical Harmonic (SH) coefficients after conformal mapping. Specifically, a genus-zero 3D mesh object is first conformally mapped onto the unit sphere by using a modified discrete conformal mapping, where the modification is based on Möbius Factorization and is aimed at obtaining a canonical conformal mappin...
متن کاملHigh-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech
In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...
متن کاملUsing Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کامل